Automatic Discovery of Similar Words

نویسندگان

  • Pierre Senellart
  • Vincent D. Blondel
چکیده

We deal with the issue of automatic discovery of similar words (synonyms and near-synonyms) from different kind of sources: from large corpora of documents, from the Web, and from monolingual dictionaries. We present in detail three algorithms that extract similar words from a large corpus of documents and consider the specific case of the World Wide Web. We then describe a recent method of automatic synonym extraction in a monolingual dictionary. The method is based on an algorithm that computes similarity measures between vertices in graphs. We use the 1913 Webster’s Dictionary and apply the method on four synonym queries. The results obtained are analyzed and compared with those obtained with two other methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining

Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...

متن کامل

The Semantic and Rhetorical Function of the Synonymous and Antonymous Concepts of “Infaq” in the Holy Quran

The syntagmatic (descriptive) semantic approach is an attempt to represent the words and their relations existing in the human mind. Considering this idea, the present paper, while applying this approach, seeks to provide a descriptive analysis of the concept of infaq and to explain the semantic and rhetorical function of the concepts that having a syntagmatic relation with it are sometimes use...

متن کامل

A WordNet-Based Near-Synonyms and Similar-Looking Word Learning System

Near-Synonyms and Similar-Looking (NSSL) words can create confusion for English as Foreign Language Learners as a result of a type of lexical error that often occurs when they confuse similar-looking words that are near synonyms to have the same meaning. Particularly, this may occur if the similar-looking words have the same translated meaning. This study proposes a method to find these NSSL wo...

متن کامل

A Survey Paper on Concept Mining in Text Documents

1. Berry Michael W., (2004), “Automatic Discovery of Similar Words”, in “Survey of Text Mining: Clustering, Classification and Retrieval”, Springer Verlag, New York, LLC, 24-43 2. Navathe, Shamkant B., and Elmasri Ramez, (2000), “Data Warehousing and Data Mining”, in “Fundamentals of Database Systems”, Pearson Education pvtInc, singapore, 841-872. 3. HaralamposKaranikas and BabisTheodoulidis Ma...

متن کامل

Human-Yeast Hybrids: New Visions to Genetic Disorders and Drug Discovery

Yeast has been a very helpful organism for centuries, especially with respect to fermentation of sugars and production of bread. However, for an even longer time, yeast has been a distant relative of humans having diverged from a common ancestor, about one billion years ago. More than one third of the yeast genes have human counterparts, despite this evolutionary distance. Yeast and human ortho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003